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TITLE OF THE INVENTION 
SPEECH ENCODING METHOD^ SPEECH DECODING METHOD 
AND ELECTRONIC APPARATUS 

CROSS-REFERENCE TO RELATED APPLICATIONS 
5 This application is based upon and claims the 

benefit of priority from the prior Japanese Patent 
Application No. 2000-320679, filed on October 20, 2000; 
the entire contents of which are incorporated herein by 
reference • 

10 BACKGROUND OF THE INVENTION 

1, Field of the Invention 

The present invention relates to a speech encoding 
method and speech decoding method which are used to 
compression-encode and decode speech signals, audio 
15 signals, and the like, 

2, Description of the Background Art 

As a method of compression-encoding speech 
signals, a CELP (Code-Excited Linear Prediction) scheme 
is known ("Code-Excited Linear Prediction (CELP): 

20 High-quality Speech at Very Low Rates "Rroc. ICASSP 

'85, 25, pp, 937 - 940, 1985), 

According to characteristic features of the CELP 
scheme, modeling of a speech signal is performed 
separately for a synthesis filter and an excitation 

25 signal for driving the synthesis filter, and distortion 

is evaluated in accordance with the level of 
a perceptually weighted speech signal in encoding 
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the excitation signal^ thereby making it difficult to 
perceive encoding distortion. A synthesized speech 
signal after encoding is generated by passing the 
excitation signal through the synthesis filter. The 
5 excitation signal is generated by combining two code 

vectors, i.e., an adaptive code vector generated from 
an adaptive codebook storing past excitation signals 
and a stochastic vector generated from a stochastic 
codebook . 

10 An adaptive code vector mainly represents 

repetition of a waveform based on a pitch period as 
a feature of an excitation signal in a voiced speech 
interval. A stochastic code vector contains 
a component for compensating for a component contained 

15 in an excitation signal which cannot be expressed 

by an adaptive code vector, and is used to make 
a synthesized speech signal more natural. 

An adaptive codebook is a codebook using the fact 
that a repeating waveform based on a pitch period of 

20 an excitation signal is similar to the repeating 

waveform of an immediately preceding excitation signal. 
More specifically, past excitation signals are 
stored in the adaptive codebook without any changes, 
and a past excitation signal is extracted from the 

25 adaptive codebook by an amount corresponding to a pitch 

period. The vector obtained by repeating the extracted 
signal with a pitch interval at a pitch period up to 
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a signal interval is used as an adaptive code vector. 
As described above, according to the conventional 
adaptive codebook, the current adaptive code vector is 
obtained by directly repeating an excitation signal 
5 used in the past* In this conventional method, if 

the encoding bit rate is decreased to about 4 kbits/s, 
since an insufficient number of bits are assigned to 
express an excitation signal, distortion due to 
encoding is clearly perceived. As a consequence, the 

10 speech becomes unclear or noisy. That is, the sound 

quality considerably deteriorates. Demands have 
therefore arisen for a high-efficiency encoding scheme 
that can generate synthesized speech with high quality 
even if the bit rate is decreased, 

15 As described above, in the conventional speech 

encoding method, it is difficult to obtain synthesized 
speech with high quality at a low bit rate. 

It is an object of the present invention to 
provide a speech encoding method/speech decoding method 

20 which can generate synthesized speech with high quality 

even at a low bit rate. 

BRIEF SUMMARY OF THE INVENTION 
The present inventor has given special 
consideration to the fact that in pitch period 

25 components contained in a voiced speech signal, low 

frequency components exhibit repetition with a stronger 
correlation than high frequency components in terms of 



frequency. That is, pitch repetition components in a 
low frequency band tend to change slowly, whereas pitch 
repetition components in a high frequency band tend to 
change quickly. 

In consideration of the characteristics of the 
pitch period components contained in the speech signal, 
therefore, the degree of contribution to a better 
expression of an excitation signal by an obtained 
adaptive code vector is generally higher on the 
low-frequency side than on the high-frequency side. 
That is, excitation signals in a low frequency band can 
be stored in an adaptive codebook and reused more 
effectively than excitation signals in a high frequency 
band. Therefore, the conventional method is not 
necessarily effective, in which excitation signals in 
all frequency bands are stored in an adaptive codebook 
in the same manner. 

The present invention has been made in 
consideration of the general tendency that the 
contributions of adaptive code vectors in different 
frequency bands vary, and the contributions of adaptive 
code vectors decrease with an increase in frequency. 

Synthesized speech with high quality can be 
obtained and excellent synthesized speech can be 
obtained even at a low bit rate by changing 
characteristics depending on such frequency bands, 
i.e., updating an adaptive codebook by using an 
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excitation signal after modification by excitation 
filter processing (adjusting an output in accordance 
with a frequency band) . 

According to the present invention, there is 
5 provided a speech encoding method of generating a 

synthesized speech signal by using an excitation signal 
generated by using an adaptive codebook storing a past 
excitation signal, comprising modifying an excitation 
signal used to generate a synthesized speech signal by 

10 filtering, and storing the modified excitation signal 

in the adaptive codebook* 

A speech encoding/decoding method is provided, 
which can synthesize speech with high quality by 
storing an excitation signal modified by predetermined 

15 filter processing in an adaptive codebook instead of 

storing an excitation signal in the adaptive codebook 
without any modification as in the conventional method. 

As described above, since an adaptive code vector 
in a lower frequency band contributes more to an 

20 excitation signal, low-pass characteristics are 

preferably provided. An excitation signal can be 
generated by using a first code vector obtained from 
an adaptive codebook (first codebook) reflecting 
periodicity and a second code vector (e.g., 

25 a stochastic code vector) obtained from another kind 

of codebook (a second codebook, e.g., a stochastic 
codebook) . However, the present invention is not 
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limited to the stochastic codebook, and the number of 
codebooks used is not limited to two; an excitation 
signal can be obtained from a plurality of codebooks 
including an adaptive codebook. 
5 For example, the present invention can be 

implemented by a speech encoding method of generating 
a synthesized speech signal by using an excitation 
signal generated by using a first code vector obtained 
from an adaptive codebook storing a past excitation 
10 signal and a second code vector obtained from 

a predetermined codebook (e.g., a stochastic codebook). 
This speech encoding method comprises selecting code 
information representing a first code vector by using 
the adaptive codebook so as to reduce perceptually 
15 weighted distortion between a target vector obtained 

from an input speech signal and a synthesized vector 
obtained by synthesizing candidate vectors of the first 
code vector; selecting code information representing 
a second code vector from the codebook so as to reduce 
2 0 perceptually weighted distortion of the synthesized 

speech signal; generating an excitation signal by using 
the selected first and second code vectors; modifying 
the generated excitation signal by filter processing; 
and storing the modified excitation signal in the 
25 adaptive codebook. 

When an excitation signal is to be generated from 
an adaptive code vector obtained from an adaptive 
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codebook and a stochastic code vector obtained from 
a stochastic codebook, an excitation signal before 
modification is given by, for example, an excitation 
vector u expressed by the following equation, and is 
5 input to a synthesis filter to obtain synthesized 

speech. Note that the excitation signal is not limited 
to this . 

u=GOxO+Glxl 

where u is an excitation vector, xO is an adaptive code 
10 vector, xl is a stochastic code vector, GO is the gain 

of the adaptive code vector, and Gl is the gain of the 
stochastic code vector. 

Filters with various conditions can be used for 
filter processing to be performed for this excitation 
15 signal before modification. For example, excitation 

filter processing is performed for the excitation 
signal before modification by using a recursive filter 
expressed by R(z) = 1/(1 - klz"^) (kl: filter 
coefficient) in a z-transform domain, and the result is 
2 0 stored as latest data in the adaptive codebook. 

The excitation vector modified by using such 
filter processing is given by 

v(n)=u(n)+klv(n-l) 
where v is the modified excitation vector, u(n) is the 
25 current excitation signal, v{n) is the modified 

excitation signal, and kl is a filter coefficient. 

Note that this excitation filter is not limited to 
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a single-order recursive filter, and a multi-order 
filter or non-recursive filter may be used. 

In addition, characteristics of an excitation 
filter may change depending on encoding information 
5 (synthesis filter information, pitch period, gain 

information, and the like or input speech signal). In 
this case, the excitation signal may remain the same 
before and after modification depending on conditions. 
The present invention can be applied to an 
10 electronic apparatus designed to perform digital speech 

processing, e.g., a handyphone, portable terminal, or 
personal computer with speech processing. 

According to the present invention, there is 
provided an electronic apparatus comprising a speech 
15 encoder which executes the above speech encoding 

method, and a speech input device (a direct speech 
input device such as a microphone or an input device 
which inputs a speech signal that is externally 
supplied) for supplying a speech signal to the speech 
2 0 encoder. 

In addition, according to the present invention, 
there is provided an electronic apparatus comprising a 
speech decoder which executes the above speech decoding 
method for the speech signal encoded by the above 
25 speech encoding method, and a speech output device 

(a direct sound device such as a loudspeaker or 
a speech supply device which supplies a speech signal 
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to an external apparatus) for outputting a speech 
signal from the speech decoder. 

If an electronic apparatus includes both an 
encoder and a decoder, the apparatus can encode and 
5 decode speech signals. If, however, decoding is not 

required, the apparatus may include only an encoder 
together with another means necessary therefor. 
If only decoding is required, the apparatus may include 
only a decoder together with another means necessary 
10 therefor. 

A handyphone requires both an encoding function 
and a decoding function because it transmits /receives 
signals to/from a remote apparatus. 

In base stations and relay stations constituting 
15 a telephone network, analog and digital lines must be 

connected to each other in some cases. In such cases 
as well, since encoded speech signals are supplied from 
the digital line side, and analog speech signals before 
encoding are supplied from the analog line side, 
20 encoding and decoding must be performed for the 

respective operations. Therefore, both an encoding 
function and a decoding function are required. 
The present invention can also be applied to an 
electronic apparatus designed to receive a speech 
25 signal from an external apparatus and return the signal 

to the external apparatus or transfer it to another 
apparatus upon encoding it. 
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Additional objects and advantages of the invention 
will be set forth in the description which follows, and 
in part will be obvious from the description, or may 
be learned by practice of the invention. The objects 
5 and advantages of the invention may be realized and 

obtained by means of the instrumentalities and combina- 
tions particularly pointed out hereinafter. 

BRIEF DESCRIPTION OF THE SEVERAL VIEWS OF THE DRAWING 
The accompanying drawings, which are incorporated 
10 in and constitute a part of the specification, illus- 

trate presently preferred embodiments of the invention, 
and together with the general description given above 
and the detailed description of the preferred embodi- 
ments given below, serve to explain the principles of 
15 the invention, 

FIG. 1 is a block diagram showing speech encoding 
according to an embodiment of the present invention; 

FIG. 2 is a block diagram showing an excitation 
filter according to the embodiment of the present 
20 invention; 

FIG. 3 is a view for explaining an adaptive 
codebook according to the embodiment of the present 
invention; 

FIG. 4 is a block diagram showing speech decoding 
25 according to the embodiment of the present invention; 

FIG. 5 is a view for explaining the function of 
the excitation filter according to the embodiment of 
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the present invention; 

FIG. 6 is a block diagram showing an excitation 
filter according to the embodiment of the present 
invention; 

5 FIG. 7 is a block diagram showing an excitation 

filter according to the embodiment of the present 
invention; and 

FIG- 8 is a block diagram showing an excitation 
filter according to the embodiment of the present 

10 invention. 

DETAILED DESCRIPTION OF THE INVENTION 
An embodiment of the present invention will be 
described with reference to the views of the 
accompanying drawing. FIG. 1 is a schematic block 

15 diagram showing a speech encoding method in this 

embodiment of the present invention. An input speech 
signal input from a speech input device (not shown) 
such as a microphone is A/D-converted and processed in 
units of frames each corresponding to a predetermined 

20 period of time. An LPC analyzer 101 analyzes the 

framed input speech signal to extract linear predictive 
coefficients (LPC coefficients). A synthesis filter 
information encoder 102 encodes the extracted LPC 
coefficients and outputs synthesis filter information A 

25 to a multiplexer 103. The linear predictive 

coefficients are used as synthesis filter coefficients 
(a(i): the order of a filter is set to, for example. 
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10, as needed) of a synthesis filter section 104. 
Subsequently, for example, each frame is divided into 
subframes corresponding to predetermined time intervals 
to obtain pitch period information L, stochastic code 
5 C, and gain information G. An adaptive codebook 105 

stores past excitation signals (past excitation signals 
modified by filter processing in the present 
invention). Upon reception of a pitch period as 
a candidate, the adaptive codebook 105 retraces by 
10 a length corresponding to the pitch period and extracts 

an excitation signal. The adaptive codebook 105 
generates an adaptive code vector by repeating this 
signal . 

In searching for a pitch period, a perceptually 
15 weighted distortion computation section 109 calculates 

the waveform distortion caused when the synthesis 
filter section 104 synthesizes an adaptive code vector 
corresponding to a pitch period candidate, and a code 
selector 106 searches for a pitch period in which the 
2 0 distortion of the perceptually weighted synthesized 

waveform is reduced more. Although the value obtained 
by open loop pitch analysis on a frame basis can be 
used as the initial value of a candidate pitch, the 
present invention is not limited to this. 
25 The pitch period determined by the adaptive 

codebook search is converted into the pitch period 
information L and output to the multiplexer 103. 
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A stochastic codebook 107 outputs a stochastic 
vector corresponding to the supplied stochastic code as 
a stochastic code vector candidate. In some scheme, a 
stochastic codebook is structured so as not to directly 
5 store stochastic code vectors. For example, a scheme 

using an Algebraic codebook is available. This 
Algebraic codebook is designed to express a code vector 
by a combination of pulse position information and 
polarity information with the amplitudes of a 
10 predetermined number of pulses being limited to +1 

and -1. According to characteristic features of the 
Algebraic codebook, a codebook can be expressed by 
a small memory capacity because any code vectors 
themselves need not be stored, and stochastic 
15 components contained in excitation information can be 

expressed with relatively high quality in spite of 
a small calculation amount required for code vector 
selection. 

A scheme using an Algebraic codebook to encode 
20 excitation signals is called an ACELP scheme or 

ACELP-based scheme and known as a scheme of obtaining 
a synthesized speech with little distortion. 

In searching for the stochastic code C, the 
perceptually weighted distortion computation section 
25 109 computes the perceptually weighted distortion 

contained in the waveform formed when a stochastic code 
vector corresponding to a stochastic code candidate is 
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synthesized by the synthesis filter section 104, and 
the code selector 106 searches for a stochastic code 
with which the distortion of this perceptually weighted 
synthesized waveform is reduced more. The found 
5 stochastic code C is output to the multiplexer 103. 

In this embodiment, the expression "stochastic 
codebook" is used. Obviously, however, a stochastic 
code vector expressed by this codebook need not always 
be stochastic. For example, this code vector may be 
10 a pulse excitation code vector as in an Algebraic 

codebook. 

A gain codebook 108 stores candidates for a gain 
GO used for an adaptive code vector and a gain Gl used 
for a stochastic code vector. For example, in 

15 searching for a gain code, the perceptually weighted 

distortion computation section 10 9 computes the 
perceptually weighted distortion contained in the 
waveform formed when the excitation code vector 
obtained by adding the adaptive code vector and 

2 0 stochastic code vector multiplied by gain candidates, 

respectively, is synthesized by the synthesis filter. 
The code selector 106 searches for a gain code with 
which the distortion of the perceptually weighted 
synthesized waveform is reduced more. 

25 The found gain code G is output to the multiplexer 

103. Various methods can be used to determine the 
above pitch period information L, stochastic code C, 
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and gain information G, For example, the following 
method can be used. 

The pitch period information L is obtained by 
an adaptive codebook search (adaptive code vector). 
5 The stochastic code C (stochastic code vector) is then 

obtained by making a stochastic codebook search so as 
to reduce the difference between the target vector and 
the vector obtained by multiplying the obtained 
adaptive code vector by a temporary gain (e.g., optimal 
10 gain). The gain information G (gain code vector) is 

obtained by making a gain codebook search using the 
obtained adaptive code vector and stochastic code 
vector. 

Apparently, the present invention is not limited 
15 to the above method. By using the pitch period 

information L, stochastic code C, and gain information 
G found in this manner, an excitation signal 
(excitation vector) u is generated according to 
equation ( 1 ) : 
20 u=GOxO+Glxl "•(!) 

where xO is the adaptive code vector obtained from 
the adaptive codebook 105 in correspondence with 
the pitch period information L, xl is the stochastic 
code vector obtained from the stochastic codebook 107 
25 in correspondence with the stochastic code C, GO is 

a gain which is obtained from the gain codebook 108 
in correspondence with the gain information G 



16 - 



and multiplied with the adaptive code vector in 
a multiplier 111, and Gl is a gain which is obtained 
from the gain codebook 108 in correspondence with the 
gain information G and multiplied with the stochastic 
5 code vector in a multiplier 112. The outputs of the 

multipliers 111 and 112 are added by an adder 113. 

The synthesis filter section 104 generates a 
synthesized speech by performing synthesis filtering 
expressed as l/A(z):A(z) = 1 + Sa(i)z-^ where a(i) is 

10 a synthesis filter coefficient (synthesis filter 

information A) in a z-transform domain with respect to 
the input of the excitation signal u obtained in this 
manner. This synthesized speech and input speech are 
subtracted from each other in an adder 114, and the 

15 above various selection/determination steps are then 

performed to reduce the difference, i.e., the 
distortion of the perceptually weighted synthesized 
waveform calculated by the perceptually weighted 
distortion computation section 109. 

2 0 The obtained excitation vector u is modified (or 

corrected) by the excitation filter 110 and stored in 
the adaptive codebook 105. Various methods can be used 
for this modification (or correction). For example, 
the vector can be modified by directly filtering it 

2 5 using an excitation filter having predetermined 

characteristics. As this excitation filter, for 
example, a single-order recursive filter expressed by 



equation (2) given below can be used: 

R(z)=l/(l-klz-l) ...(2) 

where kl is a filter coefficient. 

When an excitation filter having such output 

characteristics is used, an excitation signal v(n) 

after modification can be given by 

v(n)=u(n)+klv(n-l) '"(S) 

where u(n) is the excitation signal before 

modification, v(n) is the excitation signal after 

modification (n = 0, N-1, where N is the order of 

an excitation vector), and kl is a filter coefficient. 

FIG, 2 schematically shows processing by this 
excitation filter. The input excitation signal u(n) is 
input to an excitation filter 210 including a delay 
device 211, multiplier 212, and adder 213. In the 
excitation filter 210, the multiplier 212 multiplies a 
signal v(n-l), obtained by delaying the output signal 
v(n) from the excitation filter using the delay device 
211, by the filter coefficient kl, and the adder 213 
then adds the excitation signal u(n) to the product, 
thereby outputting the resultant signal as the modified 
excitation signal v(n). 

As described above, since a better effect can be 
obtained by increasing the degree of contribution in a 
low frequency band, a better effect can be obtained by 
providing low-pass characteristics. According to 
experiments, a value satisfying 0 < kl < 0.25 or the 



like is preferably used. The excitation signal v(n) 
modified in this manner is stored as latest information 
in the adaptive codebook. The adaptive codebook is 
updated by being shifted by N samples as a whole so as 
to discard the oldest excitation signal data and store 
the latest excitation signal data. The latest data is 
added in this manner. FIG. 3 is a schematic view 
showing this state. The adaptive codebook before 
update operation is made up of v(-K) v{-K+l ) , 
v(-K+N-l)v(--K+N)v(-K+N+l) , v(-2)v(-l), where N is 

the number of excitation vectors and K is the number of 
excitation signal data stored in the adaptive codebook. 

The oldest excitation signal is v(-K)v(-K+l) , , 

v(-K+N-l), which is discarded. The data 

"v(0)v(l), , v(N-l)" obtained from the latest 

excitation signal "u(0)u(l), u(N-l)" before 

modification by excitation filtering [v(n) = u(n) + 
klv(n-l): (n = 0, N-1)] is stored in the adaptive 

codebook as the latest data. 

The synthesis filter information A, pitch period 
information L, stochastic code C, and gain information 
G obtained by the above encoding method are 
multiplexed, and the multiplexed encoded output is 
sent out. 

Decoding to be performed upon reception of 
this encoded information will be described below 
with reference to FIG. 4. A demultiplexer 401 
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demultiplexes the encoded input to obtain the synthesis 
filter information A, linear predictive pitch 
period information L, stochastic code C, and gain 
information G. These pieces of information are 
5 respectively sent out to a synthesis filter information 

decoder 402, adaptive codebook 403, stochastic codebook 
4 04, and gain codebook 405. 

The synthesis filter information decoder 4 02 
obtains a linear predictive coefficient (LPC) on the 

10 basis of the obtained synthesis filter information A, 

reconstructs the same LPC coefficient as that on the 
encoding side, and sends out the LPC coefficient to 
a synthesis filter section 406. The adaptive codebook 
403 stores past excitation signals like the codebook on 

15 the encoding side. The adaptive codebook 403 retraces 

from the latest signal by a length corresponding to 
the pitch period L and extracts an excitation signal. 
The adaptive codebook 4 03 generate an adaptive code 
vector by repeating this signal. 

20 The stochastic codebook 404 outputs a stochastic 

code vector corresponding to the stochastic code C on 
the basis of the code C. The gain codebook 405 outputs 
the gain GO for an adaptive code vector and the gain Gl 
for a stochastic code vector on the basis of the gain 

2 5 code G. 

The adaptive code vector obtained in the above 
manner is multiplied by the gain GO in a multiplier 
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40 8, and the stochastic code vector is multiplied by 
the gain Gl in a multiplier 409- These vectors are 
then added by an adder 410, and the resultant signal is 
input as the excitation signal u to a synthesis filter 
5 section 406. This operation is equivalent to equation 

1 in encoding operation. The synthesis filter section 
406 performs synthesis filter processing represented by 
1/A(z) for the input of the excitation signal vector 
(vector obtained by multiplying the respective vectors 

10 by gains) based on the adaptive code vector and 

stochastic code vector in the same manner as on the 
encoding side, thereby generating a synthesized speech. 

Note that an excitation signal v modified by an 
excitation filter 407 on the basis of the generated 

15 excitation signal u is stored as latest data in the 

adaptive codebook as in encoding operation. That is, 
the adaptive codebook having identical information to 
that on the encoding side is also held on the decoding 
side. By storing the excitation signal modified by the 

20 excitation filter in the adaptive codebook on the 

decoding side as well, a speech signal with little 
perceptual distortion, obtained on the encoding side, 
can be faithfully reproduced. 

The functional role of the excitation filter in 

2 5 encoding/decoding operation of the present invention 

will be described with reference to FIG. 5. Referring 
to FIG. 5, reference symbol (a) denotes the time 
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waveform of an excitation signal before modification; 

(b), the time waveform of an excitation signal after 
modification using an excitation filter; and (c) and 

(d), amplitude characteristics of the excitation signal 
5 (a) and modified excitation signal (b) on the frequency 

axis . 

As indicated by the dashed line, the frequency 
amplitude of the excitation signal u before 
modification using an excitation filter is almost flat 

10 without any tilt on average. In contrast to this, the 

frequency amplitude of the excitation signal v modified 
by the excitation filter 110 is not flat on average 
but has a tilt, exhibiting a higher amplitude in 
a low-frequency region. This indicates that the 

15 frequency characteristics of the excitation filter are 

equivalent to those represented by the dashed line 
indicated by "(d)" in FIG. 5. In general, this filter 
has low-pass characteristics. 

As described above, an adaptive code vector 

2 0 contributes more to better expression of an excitation 

source in a low-frequency region, and hence an 
excitation filter having such characteristics is 
preferably used to realize high quality. In addition, 
the power of an excitation signal having passed through 

2 5 the filter preferably remains the same. In this case, 

an excitation filter may be formed as follows: 

R(z)=bO/(l-blz-i) 
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where BO and bl are filter coefficients. Note that 
bO + bl = 1. 

By using an excitation filter having such output 
characteristics, the excitation signal v(n) after 
5 modification can be expressed by 

V ( n ) =bOu ( n ) +blv ( n- 1 ) 
FIG* 6 schematically shows processing by this 
excitation filter. An excitation filter 610 includes a 
delay section 611, first multiplier 612, adder 613, and 
10 second multiplier 614. The delay section 611 delays 

the output signal v(n) from the excitation filter by 
one sampling cycle to obtain a signal v(n-l). The 
first multiplier 612 then multiplies the signal v(n~l) 
by the filter coefficient bl. The adder 613 adds the 
15 resultant signal to the signal obtained by multiplying 

the excitation signal u(n) by the filter coefficient 
bO using the second multiplier 614, and outputs 
the resultant signal as the modified excitation 
signal v(n) . In this case as well, a value satisfying 
20 0 < bl < 0.25 or the like is preferably set to realize 

low-pass characteristics . 

The excitation filter to be used is not limited to 
the above recursive filter, and the present invention 
can use a non-recursive filter like the one expressed 
25 by 

R(z)=l+k2z-l 
where k2 is a filter coefficient. 
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In this case, an excitation signal v(n) after 
modification which is obtained by inputting the 
excitation signal u to the excitation filter is given 
by 

5 v(n)=u(n)+k2u(n-l ) 

FIG. 7 schematically shows processing by this 
excitation filter • 

An excitation filter 710 includes a delay section 
711, multiplier 712, and adder 713. The delay section 
10 711 delays the excitation signal v(n) by one sampling 

cycle to obtain a signal u(n-l). The first multiplier 
712 then multiplies the signal u(n-l) by a filter 
coefficient k2. The adder 713 adds the excitation 
signal u(n) to the resultant signal, and outputs the 
15 resultant signal as the modified excitation signal 

v(n) . 

As described above, since a better effect can be 
obtained by increasing the degree of contribution in a 
low frequency band, a better effect can be obtained by 

2 0 providing low-pass characteristics. According to 

experiments, a value satisfying 0 < k2 < 0.25 or the 
like is preferably set. In this case as well, the gain 
of the excitation filter can be adjusted. In this 
case, the following excitation filter may be used: 

25 R( z)=cO+clz-l 

where cO and cl are filter coefficients. 

In this case, the excitation signal v(n) after 
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modification which is obtained by inputting the 
excitation signal u to the excitation filter is given 
by 

v(n)=cOu(n)+clu(n-l) 
5 The gain of the excitation filter can be set to 1 

by setting cO + cl = 1. In this case as well, as 
described above, since a better effect can be obtained 
by increasing the degree of contribution in a low 
frequency band, a better effect can be obtained by 

10 providing low-pass characteristics for the excitation 

filter. A value satisfying 0 < (cl/cO) < 0.25 or the 
like is preferably set. 

FIG. 8 schematically shows processing by this 
excitation filter. An excitation filter 810 includes a 

15 delay section 811, first multiplier 812, adder 813, and 

second multiplier 814. The delay section 811 delays 
the excitation signal v(n) by one sampling cycle to 
obtain the signal u(n - 1). The first multiplier 812 
multiplies the signal u(n - 1) by a filter coefficient 

20 cl. The adder 813 then adds the resultant signal to 

the signal obtained by multiplying the excitation 
signal u(n) by a filter coefficient cO using the second 
multiplier 814, and outputs the resultant signal as the 
modified excitation signal v(n). 

25 The excitation filter need not have fixed 

characteristics. A plurality of excitation filters 
having different characteristics may be selectively 



- 25 - 



used, or an excitation filter having variable 
characteristics, e.g., an excitation filter capable of 
varying the value of the filter coef f icient ( s ) may be 
used. Note that information transfer must be performed 
5 to allow the use of excitation filters having the same 

characteristics on the encoding and decoding sides. 

For example, a method of changing the filter 
characteristics of an excitation filter by using the 
encoded information of a speech signal is available. 

10 A mechanism of making the filter characteristics of the 

excitation filter shown in FIG. 1 adaptive on the basis 
of present or past encoded information (A, L, G, and 
the like) can be used. In this case, a filter 
characteristic R(f(y), z): f(y) of the excitation 

15 filter is a function of a variable y, and y can be 

expressed as present or past encoded information. 
Alternatively, excitation filters can be switched by 
selecting one set of excitation filter coefficients 
from a plurality of sets of excitation filter 

20 coefficients. 

By switching the characteristics of an excitation 
filter on the basis of the encoded information of 
speech, an excitation filter can be adaptively used 
in accordance with the features of a speech signal. 

2 5 In addition, there is no need to send additional 

information required to switch excitation filters. 
An excitation signal used to generate 
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a synthesized speech may be preferably stored in the 
adaptive codebook without any modification depending on 
conditions. For this reason, switching of excitation 
filters or changing of filter characteristics is 
5 preferably selected in consideration of the above case 

as well, in which no excitation filtering is performed. 
The present invention is not limited to those described 
above, and various excitation filters can be used. 
By updating the adaptive codebook with excitation 

10 signals having undergone modification by the excitation 

filter, an adaptive codebook that places emphasis on a 
portion exhibiting great contribution to an excitation 
signal can be obtained. 

Synthesized speech can be obtained, which has high 

15 quality as compared with a case where an adaptive 

codebook storing excitation signals without any changes 
is used. 

As has been described above, according to the 
present invention, a speech encoding/decoding method 

20 capable of obtaining a synthesized speech with high 

quality can be obtained. 

Additional advantages and modifications will 
readily occur to those skilled in the art. Therefore, 
the invention in its broader aspects is not limited to 

25 the specific details and representative embodiments 

shown and described herein. Accordingly, various 
modifications may be made without departing from 
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the spirit or scope of the general inventive concept as 
defined by the appended claims and their equivalents. 
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